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Draft Genome Sequence of the Hydrogen- and Ethanol-Producing 
Bacterium Clostridium intestinale Strain URNW 
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Here, we report the draft genome sequence of Clostridium intestinale strain URNW, which can convert biomass to useful prod- 
ucts such as biofuels (hydrogen or ethanol) and other soluble end products. 
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Clostridium intestinale strain URNW is a Gram-positive, meso- 
philic, anaerobic spore-forming bacterium (1). Its optimum 
growth temperature is 37°C, but it can grow at a range of temper- 
atures from 22°C to a maximum of 45°C and has displayed a 
doubling time of 1.5 h (1). Nucleotide sequence analyses of Clos- 
tridium intestinale URNW 16S rRNA genes and chaperonin 60 
(cpn60) genes revealed a 98% identity with C. intestinale ATCC 
49213 (1, 2). Phylogenetic analyses based on 16S rDNA and cpn60 
nucleotide sequences indicated that C. intestinale URNW clus- 
tered with butyrate-producing hydrogen producers, such as C. in- 
testinale, C. acetobutylicum, C. perfringens, C. butyricum, and 
C. beijirinckii (1). 

The genome of C. intestinale strain URNW was sequenced by 
the McGill University and Genome Quebec Innovation Centre by 
use of the paired-end and shotgun protocols of the Roche/454s 
GS-FLX Titanium system and the paired-end protocol of the Illu- 
mina Hiseq 2000 platform. The raw reads of the 454 and Illumina 
were first preprocessed with GS2.6 and Trimmomatic-0.22 (3) for 
adapter clipping and quality trimming before assembly. After pre- 
processing, mixed paired-end and unpaired Illumina short reads 
and 454 long reads were pooled for assembly with the Velvet (4) 
assembler. Following Velvet assembly, further scaffolding and gap 
filling were performed with SSPACE v2.0 (5) and GapFiller v 1 . 1 1 
(6). The final draft genome assembly has coverages of -600 for 
Illumina reads and -28 for 454 reads and contains 39 contigs with 
a total size of 4,67 1 , 1 78 bp, an N 50 contig length of 35 1 ,5 14 nucle- 
otides, and a mean G+C content of 30.2%. The average contig 
length of is 119,774 bp and the longest contig length is 
1,533,141 bp. 

The draft genome assembly was submitted to NCBI WGS ge- 
nome sequencing project and annotated with the NCBI Prokary- 
otic Genome Annotation Pipeline (PGAP). The NCBI PGAP an- 
notation GenBank file was uploaded to the web service of the Gene 
Prediction Improvement Pipeline (GenePRIMP) (7) developed 
by JGI for analysis of gene anomalies. Based on the report of Ge- 
nePRIMP anomalies, manual curation was done using Artemis 
(8). The final annotation of the draft genome sequence of C. in- 



testinale strain URNW has a total of 4,660 genes, including 4,367 
protein-coding genes, 182 pseudogenes, 83 tRNAs, and 28 rRNAs. 

C. intestinale strain URNW protein-coding genes were verified 
using other Clostridium species as reference organisms. Amino 
acid sequences for each gene product were retrieved from the JGI 
genome portal (http://genome.jgi-psf.org/) and the NCBI data- 
base (http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi), and se- 
quence alignments against C. intestinale strain URNW genes were 
performed. Key enzymes involved in core metabolism were iden- 
tified using an in silico approach, which revealed that C. intestinale 
strain URNW is a potential candidate for the production of bio- 
fuels such as hydrogen or ethanol. All 39 contigs were analyzed by 
the online software IslandViewer and did not show any genes as- 
sociated with pathogenicity (9). 

Nucleotide sequence accession numbers. The genome se- 
quence of Clostridium intestinale strain URNW has been depos- 
ited at DDBJ/EMBL/GenBank under the accession no. 
APJA00000000. The version described in this paper is the first 
version, accession no. APJA01000000. 
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